Scene Character Detection and Recognition with Cooperative Multiple-Hypothesis Framework
Identifieur interne : 000187 ( Main/Exploration ); précédent : 000186; suivant : 000188Scene Character Detection and Recognition with Cooperative Multiple-Hypothesis Framework
Auteurs : RONG HUANG [Japon] ; Palaiahnakote Shivakumara [Japon] ; YAOKAI FENG [Japon] ; Seiichi Uchida [Japon]Source :
- IEICE transactions on information and systems [ 0916-8532 ] ; 2013.
Descripteurs français
- Pascal (Inist)
- Wicri :
- topic : Vote.
English descriptors
- KwdEn :
Abstract
To handle the variety of scene characters, we propose a cooperative multiple-hypothesis framework which consists of an image operator set module, an Optical Character Recognition (OCR) module and an integration module. Multiple image operators activated by multiple parameters probe suspected character regions. The OCR module is then applied to each suspected region and returns multiple candidates with weight values for future integration. Without the aid of the heuristic rules which impose constraints on segmentation area, aspect ratio, color consistency, text line orientations, etc., the integration module automatically prunes the redundant detection/recognition and pads the missing detection/recognition. The proposed framework bridges the gap between scene character detection and recognition, in the sense that a practical OCR engine is effectively leveraged for result refinement. In addition, the proposed method achieves the detection and recognition at the character level, which enables dealing with special scenarios such as single character, text along arbitrary orientations or text along curves. We perform experiments on the benchmark ICDAR 2011 Robust Reading Competition dataset which includes a text localization task and a word recognition task. The quantitative results demonstrate that multiple hypotheses outperform a single hypothesis, and be comparable with state-of-the-art methods in terms of recall, precision, F-measure, character recognition rate, total edit distance and word recognition rate. Moreover, two additional experiments are conducted to confirm the simplicity of parameter setting in this proposal.
Affiliations:
Links toward previous steps (curation, corpus...)
- to stream PascalFrancis, to step Corpus: 000042
- to stream PascalFrancis, to step Curation: 000726
- to stream PascalFrancis, to step Checkpoint: 000030
- to stream Main, to step Merge: 000190
- to stream Main, to step Curation: 000187
Le document en format XML
<record><TEI><teiHeader><fileDesc><titleStmt><title xml:lang="en" level="a">Scene Character Detection and Recognition with Cooperative Multiple-Hypothesis Framework</title>
<author><name sortKey="Rong Huang" sort="Rong Huang" uniqKey="Rong Huang" last="Rong Huang">RONG HUANG</name>
<affiliation wicri:level="4"><inist:fA14 i1="01"><s1>Graduate School and Faculty of Information Science and Electrical Engineering, Kyushu University</s1>
<s2>Fukuoka-shi, 819-0395</s2>
<s3>JPN</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
<sZ>4 aut.</sZ>
</inist:fA14>
<country>Japon</country>
<placeName><settlement type="city">Fukuoka</settlement>
<region type="province">Kyūshū</region>
<region type="prefecture">Préfecture de Fukuoka</region>
</placeName>
<orgName type="university">Université de Kyūshū</orgName>
</affiliation>
</author>
<author><name sortKey="Shivakumara, Palaiahnakote" sort="Shivakumara, Palaiahnakote" uniqKey="Shivakumara P" first="Palaiahnakote" last="Shivakumara">Palaiahnakote Shivakumara</name>
<affiliation wicri:level="4"><inist:fA14 i1="01"><s1>Graduate School and Faculty of Information Science and Electrical Engineering, Kyushu University</s1>
<s2>Fukuoka-shi, 819-0395</s2>
<s3>JPN</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
<sZ>4 aut.</sZ>
</inist:fA14>
<country>Japon</country>
<placeName><settlement type="city">Fukuoka</settlement>
<region type="province">Kyūshū</region>
<region type="prefecture">Préfecture de Fukuoka</region>
</placeName>
<orgName type="university">Université de Kyūshū</orgName>
</affiliation>
</author>
<author><name sortKey="Yaokai Feng" sort="Yaokai Feng" uniqKey="Yaokai Feng" last="Yaokai Feng">YAOKAI FENG</name>
<affiliation wicri:level="4"><inist:fA14 i1="01"><s1>Graduate School and Faculty of Information Science and Electrical Engineering, Kyushu University</s1>
<s2>Fukuoka-shi, 819-0395</s2>
<s3>JPN</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
<sZ>4 aut.</sZ>
</inist:fA14>
<country>Japon</country>
<placeName><settlement type="city">Fukuoka</settlement>
<region type="province">Kyūshū</region>
<region type="prefecture">Préfecture de Fukuoka</region>
</placeName>
<orgName type="university">Université de Kyūshū</orgName>
</affiliation>
</author>
<author><name sortKey="Uchida, Seiichi" sort="Uchida, Seiichi" uniqKey="Uchida S" first="Seiichi" last="Uchida">Seiichi Uchida</name>
<affiliation wicri:level="4"><inist:fA14 i1="01"><s1>Graduate School and Faculty of Information Science and Electrical Engineering, Kyushu University</s1>
<s2>Fukuoka-shi, 819-0395</s2>
<s3>JPN</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
<sZ>4 aut.</sZ>
</inist:fA14>
<country>Japon</country>
<placeName><settlement type="city">Fukuoka</settlement>
<region type="province">Kyūshū</region>
<region type="prefecture">Préfecture de Fukuoka</region>
</placeName>
<orgName type="university">Université de Kyūshū</orgName>
</affiliation>
</author>
</titleStmt>
<publicationStmt><idno type="wicri:source">INIST</idno>
<idno type="inist">13-0328463</idno>
<date when="2013">2013</date>
<idno type="stanalyst">PASCAL 13-0328463 INIST</idno>
<idno type="RBID">Pascal:13-0328463</idno>
<idno type="wicri:Area/PascalFrancis/Corpus">000042</idno>
<idno type="wicri:Area/PascalFrancis/Curation">000726</idno>
<idno type="wicri:Area/PascalFrancis/Checkpoint">000030</idno>
<idno type="wicri:doubleKey">0916-8532:2013:Rong Huang:scene:character:detection</idno>
<idno type="wicri:Area/Main/Merge">000190</idno>
<idno type="wicri:Area/Main/Curation">000187</idno>
<idno type="wicri:Area/Main/Exploration">000187</idno>
</publicationStmt>
<sourceDesc><biblStruct><analytic><title xml:lang="en" level="a">Scene Character Detection and Recognition with Cooperative Multiple-Hypothesis Framework</title>
<author><name sortKey="Rong Huang" sort="Rong Huang" uniqKey="Rong Huang" last="Rong Huang">RONG HUANG</name>
<affiliation wicri:level="4"><inist:fA14 i1="01"><s1>Graduate School and Faculty of Information Science and Electrical Engineering, Kyushu University</s1>
<s2>Fukuoka-shi, 819-0395</s2>
<s3>JPN</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
<sZ>4 aut.</sZ>
</inist:fA14>
<country>Japon</country>
<placeName><settlement type="city">Fukuoka</settlement>
<region type="province">Kyūshū</region>
<region type="prefecture">Préfecture de Fukuoka</region>
</placeName>
<orgName type="university">Université de Kyūshū</orgName>
</affiliation>
</author>
<author><name sortKey="Shivakumara, Palaiahnakote" sort="Shivakumara, Palaiahnakote" uniqKey="Shivakumara P" first="Palaiahnakote" last="Shivakumara">Palaiahnakote Shivakumara</name>
<affiliation wicri:level="4"><inist:fA14 i1="01"><s1>Graduate School and Faculty of Information Science and Electrical Engineering, Kyushu University</s1>
<s2>Fukuoka-shi, 819-0395</s2>
<s3>JPN</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
<sZ>4 aut.</sZ>
</inist:fA14>
<country>Japon</country>
<placeName><settlement type="city">Fukuoka</settlement>
<region type="province">Kyūshū</region>
<region type="prefecture">Préfecture de Fukuoka</region>
</placeName>
<orgName type="university">Université de Kyūshū</orgName>
</affiliation>
</author>
<author><name sortKey="Yaokai Feng" sort="Yaokai Feng" uniqKey="Yaokai Feng" last="Yaokai Feng">YAOKAI FENG</name>
<affiliation wicri:level="4"><inist:fA14 i1="01"><s1>Graduate School and Faculty of Information Science and Electrical Engineering, Kyushu University</s1>
<s2>Fukuoka-shi, 819-0395</s2>
<s3>JPN</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
<sZ>4 aut.</sZ>
</inist:fA14>
<country>Japon</country>
<placeName><settlement type="city">Fukuoka</settlement>
<region type="province">Kyūshū</region>
<region type="prefecture">Préfecture de Fukuoka</region>
</placeName>
<orgName type="university">Université de Kyūshū</orgName>
</affiliation>
</author>
<author><name sortKey="Uchida, Seiichi" sort="Uchida, Seiichi" uniqKey="Uchida S" first="Seiichi" last="Uchida">Seiichi Uchida</name>
<affiliation wicri:level="4"><inist:fA14 i1="01"><s1>Graduate School and Faculty of Information Science and Electrical Engineering, Kyushu University</s1>
<s2>Fukuoka-shi, 819-0395</s2>
<s3>JPN</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
<sZ>4 aut.</sZ>
</inist:fA14>
<country>Japon</country>
<placeName><settlement type="city">Fukuoka</settlement>
<region type="province">Kyūshū</region>
<region type="prefecture">Préfecture de Fukuoka</region>
</placeName>
<orgName type="university">Université de Kyūshū</orgName>
</affiliation>
</author>
</analytic>
<series><title level="j" type="main">IEICE transactions on information and systems</title>
<title level="j" type="abbreviated">IEICE trans. inf. syst.</title>
<idno type="ISSN">0916-8532</idno>
<imprint><date when="2013">2013</date>
</imprint>
</series>
</biblStruct>
</sourceDesc>
<seriesStmt><title level="j" type="main">IEICE transactions on information and systems</title>
<title level="j" type="abbreviated">IEICE trans. inf. syst.</title>
<idno type="ISSN">0916-8532</idno>
</seriesStmt>
</fileDesc>
<profileDesc><textClass><keywords scheme="KwdEn" xml:lang="en"><term>Aspect ratio</term>
<term>Character recognition</term>
<term>Heuristic method</term>
<term>Localization</term>
<term>Multiple image</term>
<term>Open market</term>
<term>Operator</term>
<term>Optical character recognition</term>
<term>Pattern recognition</term>
<term>Performance evaluation</term>
<term>Refinement method</term>
<term>Segmentation</term>
<term>Speech recognition</term>
<term>State of the art</term>
<term>Voting</term>
</keywords>
<keywords scheme="Pascal" xml:lang="fr"><term>Reconnaissance caractère</term>
<term>Opérateur</term>
<term>Reconnaissance optique caractère</term>
<term>Image multiple</term>
<term>Méthode heuristique</term>
<term>Segmentation</term>
<term>Rapport aspect</term>
<term>Méthode raffinement</term>
<term>Marché concurrentiel</term>
<term>Localisation</term>
<term>Reconnaissance parole</term>
<term>Evaluation performance</term>
<term>Etat actuel</term>
<term>Vote</term>
<term>Reconnaissance forme</term>
</keywords>
<keywords scheme="Wicri" type="topic" xml:lang="fr"><term>Vote</term>
</keywords>
</textClass>
</profileDesc>
</teiHeader>
<front><div type="abstract" xml:lang="en">To handle the variety of scene characters, we propose a cooperative multiple-hypothesis framework which consists of an image operator set module, an Optical Character Recognition (OCR) module and an integration module. Multiple image operators activated by multiple parameters probe suspected character regions. The OCR module is then applied to each suspected region and returns multiple candidates with weight values for future integration. Without the aid of the heuristic rules which impose constraints on segmentation area, aspect ratio, color consistency, text line orientations, etc., the integration module automatically prunes the redundant detection/recognition and pads the missing detection/recognition. The proposed framework bridges the gap between scene character detection and recognition, in the sense that a practical OCR engine is effectively leveraged for result refinement. In addition, the proposed method achieves the detection and recognition at the character level, which enables dealing with special scenarios such as single character, text along arbitrary orientations or text along curves. We perform experiments on the benchmark ICDAR 2011 Robust Reading Competition dataset which includes a text localization task and a word recognition task. The quantitative results demonstrate that multiple hypotheses outperform a single hypothesis, and be comparable with state-of-the-art methods in terms of recall, precision, F-measure, character recognition rate, total edit distance and word recognition rate. Moreover, two additional experiments are conducted to confirm the simplicity of parameter setting in this proposal.</div>
</front>
</TEI>
<affiliations><list><country><li>Japon</li>
</country>
<region><li>Kyūshū</li>
<li>Préfecture de Fukuoka</li>
</region>
<settlement><li>Fukuoka</li>
</settlement>
<orgName><li>Université de Kyūshū</li>
</orgName>
</list>
<tree><country name="Japon"><region name="Kyūshū"><name sortKey="Rong Huang" sort="Rong Huang" uniqKey="Rong Huang" last="Rong Huang">RONG HUANG</name>
</region>
<name sortKey="Shivakumara, Palaiahnakote" sort="Shivakumara, Palaiahnakote" uniqKey="Shivakumara P" first="Palaiahnakote" last="Shivakumara">Palaiahnakote Shivakumara</name>
<name sortKey="Uchida, Seiichi" sort="Uchida, Seiichi" uniqKey="Uchida S" first="Seiichi" last="Uchida">Seiichi Uchida</name>
<name sortKey="Yaokai Feng" sort="Yaokai Feng" uniqKey="Yaokai Feng" last="Yaokai Feng">YAOKAI FENG</name>
</country>
</tree>
</affiliations>
</record>
Pour manipuler ce document sous Unix (Dilib)
EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/OcrV1/Data/Main/Exploration
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000187 | SxmlIndent | more
Ou
HfdSelect -h $EXPLOR_AREA/Data/Main/Exploration/biblio.hfd -nk 000187 | SxmlIndent | more
Pour mettre un lien sur cette page dans le réseau Wicri
{{Explor lien |wiki= Ticri/CIDE |area= OcrV1 |flux= Main |étape= Exploration |type= RBID |clé= Pascal:13-0328463 |texte= Scene Character Detection and Recognition with Cooperative Multiple-Hypothesis Framework }}
This area was generated with Dilib version V0.6.32. |